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CN I The Milky Way has at least twenty-three known satellite galaxies that shine with luminosities 

I I ; ranging from about a thousand to a billion times that of the Sun. Half of these galaxies were 

^ • discovered ^'^ in the past few years in the Sloan Digital Sky Survey, and they are among the 

I . least luminous galaxies in the known Universe. A determination of the mass of these galaxies 

2 '. provides a test of galaxy formation at the smallest scales ^'^ and probes the nature of the dark 

to ! matter that dominates the mass density of the Universe ^. Here we use new measurements of 

1 ""^i ; the velocities of the stars in these galaxies to show that they are consistent with them having 

a common mass of about 10^ Mq within their central 300 parsecs. This result demonstrates 

^ ■ that the faintest of the Milky Way satellites are the most dark matter-dominated galaxies 

CN ! known, and could be a hint of a new scale in galaxy formation or a characteristic scale for 

^ I the clustering of dark matter, 
rn ; 

QQ . Many independent lines of evidence strongly argue for the presence of dark matter in galax- 

O I ies, clusters of galaxies, and throughout the observable Universe ^. Its identity, however, remains 

I a mystery. The gravity of dark matter overwhelms that of the normal atoms and molecules and 

■ hence governs the formation and evolution of galaxies and large-scale structure In the cur- 
is> ' rently favored dark matter models, structure in the Universe forms hierarchically with smaller 

. gravitationally bound clumps of dark matter - haloes- merging to form progressively larger ob- 

5^ ! jects. 



The mass of the smallest dark matter halo is determined by the particle properties of dark 
matter. Dark matter candidates characterized as cold dark matter can form haloes that are many 
orders of magnitude smaller than the least luminous haloes that we infer from observations. Cos- 
mological simulations of cold dark matter predict that galaxies like the Milky Way should be 
teeming with thousands of dark matter haloes with masses ~ IO^Mq, with a steadily increasing 
number as we go to the smallest masses ' A large class of dark matter candidates characterized 
as "warm" would predict fewer of these small haloes However, even for cold dark matter it is 
uncertain what fraction of the small dark matter haloes should host visible galaxies, as the ability 
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of gas to cool and form stars in small dark matter haloes depends on a variety of poorly-understood 
physical processes 

The smallest known galaxies hosted by their own dark matter haloes are the dwarf spheroidal 
satellites of the Milky Way These objects have very little gas and no signs of recent star 
formation. The least luminous galaxies were recently discovered in the Sloan Digital Sky Sur- 
vey (SDSS) ^'^ and follow-up observations have revealed them to be strongly dark matter domi- 
nated ^'21' 22. 

We have compiled line-of-sight velocity measurements of individual stars in 18 of the 23 
known dwarf galaxies in the Milky Way We use these measurements to determine the dynami- 
cal mass of their dark matter haloes using a maximum likelihood analysis The dynamical mass 
is best constrained within the stellar extent, which corresponds to an average radius of approxi- 
mately 0.3 kiloparsecs (kpc) for all the satellites. We determine this mass, Mq.s, by marginalizing 
over a five-parameter density profile for dark matter that allows for both steep density cusps and flat 
cores in the central regions. It is important to note that the observed velocity dispersion of stars is 
determined by both the dynamical mass and the average anisotropy of the velocity dispersion (that 
is, difference between tangential and radial dispersion). The anisotropy is unknown and hence we 
marginalize over a three-parameter stellar velocity anisotropy function that allows us to explore a 
range of orbital models for the stars 

Figure [H shows the resulting determination of M0.3. We find that all 18 dwarf galaxies are 
consistent with having a dynamical mass of 10 solar masses within 0.3 kpc of their centre, despite 
the fact that they have luminosity differences over four orders of magnitude. This result implies 
a dark matter central density of ~ 0.1 Mq/pc^ in these galaxies. Earlier studies suggested that 
the highest-luminosity dwarf galaxies all shared a common mass With larger stellar data sets, 
more than double the number of dwarf galaxies, and more detailed mass modeling, our results 
confirm this earlier suggestion and conclusively establish that the dwarf galaxies of the Milky Way 
share a common mass scale. 

Because of the proximity of the dwarf galaxies to the Milky Way, it is possible that tidal 
effects could change the velocities of stars and thus affect the mass measurements. In the kine- 
matic data, tidal forces could be revealed as a velocity gradient across the observed plane of the 
dwarf 25' 26. We have tested the dwarf galaxies for velocity gradients and have found no conclusive 
evidence of tidal effects (see Supplementary Information). 

We fitted a Mo.s-luminosity relation to the data and obtained M0.3 oc L° '^^=''° '^^. This result 
does not change significantly if we use the luminosity contained within 0.3 kpc rather than the total 
luminosity. The common mass scale of ~ lO^M© may thus reflect either a plummeting efficiency 
for galaxy formation at this mass scale, or that dark matter haloes with lower masses simply do not 
exist. 
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The characteristic density of 0.1 Mg/pc^ may be associated with a characteristic halo forma- 
tion time. In theories of hierarchical structure formation, the central density of dark matter haloes 
is proportional to the mean density of matter in the Universe when the halo formed. The earlier 
the formation, the higher the density. For cold dark matter models, our measurement implies that 
these haloes collapsed at a redshift greater than about 12, or earlier than 100 million years after 
the Big Bang. Measurements of the cosmic microwave background suggest that the Universe 
went from being neutral to ionized at redshift 11 ± 1.4. These dark matter haloes thus formed at 
approximately the same time that the Universe was re-ionized. 

Within the context of the cold dark matter theory, high-resolution cosmological simulations 
can be used to relate the mass within the central regions of the dark matter halo to the depth of the 
gravitational potential well ^^ Simulations show that M0.3 lO^M© (Mtotai/IO'^MQ)"-^^, where 
Mtotai is the mass of the halo before it was accreted into the Milky Way host potential. Thus, 
it is possible that the implied total mass scale of IO^Mq reflects the characteristic scale at which 
supernova feedback or the imprint of the re-ionization of the universe ^^-^^ could sharply suppress 
star formation. 

Perhaps a more speculative, but certainly no less compelling, explanation of the common 

mass scale is that dark matter haloes do not exist with M0.3 below ~ 10^ M©. This implies 
that these dwarf galaxies inhabit the smallest dark matter haloes in the Universe. Warm dark 
matter particles have larger free streaming length than standard cold dark matter particles, which 
implies that density perturbations are erased below a characteristic length scale, resulting in a 
higher minimum mass for dark matter haloes. A thermal warm dark matter candidate with mass of 
approximately 1 kcV would imply a minimum halo mass of lO^M©. Thus, our mass determinations 
rule out thermal warm dark matter candidates with masses less than about 1 kcV, but dark matter 
masses somewhat larger than 1 keV would yield a minimum dark matter halo mass consistent with 
the mass scale we observe. 

Future imaging surveys of stars in the Milky Way will provide a more complete census of 
low-luminosity Milky Way satellites, with the prospects of determining whether astrophysics or 
fundamental dark matter physics is responsible for setting the common mass scale. In particular, 
the masses for the faintest dwarf galaxies will become more strongly constrained with more line- 
of- sight velocity data. This will sharpen the observational picture of galaxy formation on these 
small-scales and provide data around which theories of galaxy formation may be built. 
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Figure 1: The integrated mass of the Milky Way dwarf satellites, in units of solar masses, within 
their inner 0.3 kpc as a function of their total luminosity, in units of solar luminosities. The circle 
(red) points on the left refer to the newly-discovered SDSS satellites, while the square (blue) points 
refer to the classical dwarf satellites discovered pre-SDSS. The error bars reflect the points where 
the likelihood function falls off to 60.6% of its peak value. 
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Supplementary Information 



In this section we present the details of our dynamical mass modeling. We begin by review- 
ing the Jeans formalism for determining the mass of a galaxy using line-of-sight velocities. We 
then discuss our specific treatment of the line-of-sight velocity data and introduce the likelihood 
function that we use in our analysis. Next, we present our results and discuss the implications. 
Finally, we discuss the systematics that may affect the mass modeling. 



1 Mass modeling 



When modeling the stellar distribution of a dwarf galaxy in close proximity to the Milky Way 

(MW), the internal gravitational force from the dwarf must be compared to the external force from 
the Milky Way. Generally, the internal gravitational force from the dwarf is ~ a'^/Rg, where a is 
the velocity dispersion and Rg is the half-light radius of the dwarf. The external tidal force from 
the MW potential is ~ (220 km/s)^i?s/L>^, where D is the distance to the dwarf from the center 
of the MW, and 220km/s is the approximate rotation speed of the MW in the outer regions of 
the halo where the dwarfs are located. The most luminous of the MW dwarfs have half-light radii 
of Rs ~ 400 pc, and the least luminous of the dwarfs have half-light radii of i?^ ~ 10 — 100 
pc. The velocity dispersions vary in the range a ~ 5 — 15 km s"^. Comparing the internal and 
external forces on dwarfs in the observed distance range of ~ 20 — 250 kpc, we find that the 
internal gravitational forces are typically larger by ~ 100. Note that this estimate does not exclude 
the possibility that the dwarfs have been tidally stripped in the past; it does, however, allow us to 
proceed safely with the assumption that the surviving stellar distributions trace the local potential. 



Previous studies of some Milky Way dwarf spheroidals (dSphs) show that these objects ex- 
hibit no streaming motions Making the further assumption that these systems are in steady 
state, the three-dimensional radial velocity dispersion of the system, ar, is given by the Jeans 
equation 

rf(p^a2) .GM{r) , 2 

r ' , = -p^{r) ^ - 2/3 r (1) 

ar r 

This equation is valid if the potential of the system is spherically-symmetric, which has been shown 
to be a good description of dark matter satellites in a host potential To convert to the observable 
quantity, we must integrate the solution to the Jeans equation along the line-of-sight. Performing 
this integration gives 

* ^ ^ h{R) Jr \ ^r^ J - i?2 

Here R is defined as the projected radius in the plane of the sky. The three-dimensional stellar 
density profile is defined by p*(r), which is determined from the projected stellar distribution, 
h{R). The stellar velocity anisotropy is defined as P — 1 — cr^(r) / cr^{r), where ae is the tangential 
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component of the velocity dispersion. We now discuss in turn our parameterizations of the different 
functions entering in equation [21 



Stellar Surface Density It is standard to fit the stellar surface densities of the systems we study 
to either Plummer or King profiles. The surface density for the King profile is ''^ 



4ing(^) = k 



-1/2 



-1/2- 



1 + 



Urn 



(3) 



which results in a de-projected three-dimensional density of 



Pkii 



- cos ^ z — yl 

z 



(4) 



'"^^ ' 7rr,[l + (n,„/r,)2]3/2^2 
1 + r'^ /r1) /(I + rf^^/rl). The normalization constant, k, for the King profile thus is 



where z'^ 

irrelevant when applying the Jeans equations. The King profile depends on two parameters, r/ 
and Tr. 



The surface density for the Plummer profile is given by 



UR) = (5) 

^'^ ^ 3 [1 + (i?/rpi)f 



which results in a de-projected three-dimensional density of 



The only relevant free parameter in the Plummer profile is Vpi. 



In Table \\\ we show the respective fits to the surface density for each of the dwarf satel- 
lites Many of the well-known dSphs are well-fit by King profiles, although for some galaxies 
the King profile fits have been updated to account for the observed distribution of stars in the outer 
regions As is seen, the majority of the new satellites are well-fit by Plummer profiles. In the 
instances where both Plummer and King profiles have been fit to the data, we find that the exact 
form of the fit does not strongly affect the results we present below. 



Velocity Anisotropy We assume that both tangential components of the velocity dispersion are 
equal, aj = cr|. Many of the systems we study are observed to have multiple stellar populations 
so there is no reason why the anisotropy should be constant throughout the galaxy. To account for 
radial variation, we parametrize the anisotropy profile as 

2 

P{r) = iPoo-Po)^^ + Po. (7) 

The velocity anisotropy profile of this form is thus described by an asymptotic inner value, /3o, and 
asymptotic value near the edge of the halo. Poo, and a scale radius, r^. We place the constraints on 
Po and Poo such that P{r) < 1 for all radii. 
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Dark Matter Density Profile To model the dark matter mass distribution, we use a density profile 
of the form 



The asymptotic inner slope is determined by a, the asymptotic outer slope is determined by c, and 
the transition between these two regimes is determined by b. The scale density is defined as po, and 
the scale radius is defined as tq. In all cases, the line-of-sight velocities are not able to determine 
the parameters a, b, c, though as we see below, the data do strongly constrain the mass and the 
density of these systems at a characteristic radius 

For some regions of parameter space, the density profile in equation [8] has an infinite mass, 
so it cannot represent a physically reasonable dark matter halo. To ensure that the mass is finite in 
all of parameter space, we weight equation [8] as p(r) — > p(r) exp^''/'"'^"'. We determine the cut-off 
radius, r^ut, from the standard Roche-limit criteria. 



where Mhaio is the total mass of the satellite, (Tmw is the velocity dispersion of the Milky Way 
at the position of the satellite, and D is the distance to the center of the satellite. To establish a 
conservative upper limit on rcut, in particular allowing it to span the largest physically possible 
range, for all of the satellites we take Mhaio = 10^ M0 and (Tmw = 200 km s~^. 

2 Data Modeling 

From a data set consisting of line-of-sight velocities, our goal is to determine the constraints on 
any of the parameters in equation^ To evaluate the constraints on these parameters, we undertake 
a maximum likelihood analysis and construct the probability distribution for obtaining a set of 
line-of -sight velocities in a dwarf galaxy. The goal of this section is to present and discuss our 
implementation of the maximum likelihood analysis. 

Likelihood Function The dispersion in the velocity at a given position is the sum of two compo- 
nents: 1) the dispersion from the velocity distribution function, and 2) the random error stemming 
from the uncertainty in the measurement of the velocity of each star. Motivated by theoretical 
modeling and observations of line-of-sight velocities ^, we take both of these distributions to 
be Gaussian. The dispersion stemming from the velocity distribution function is determined by 
the set of parameters in equations |7] and [8] that govern the dark matter halo model and the velocity 
anisotropy. We refer to this vector set of parameters as 9. The probability for obtaining a set of 
line-of-sight velocities, v, given the theoretical parameters, is P(v|n, at). Here u is defined as the 
systemic velocity of the galaxy, and at is determined from equation [2l It is thus implied that at is 
a function of the parameters 9. With the above assumptions, the result for P(v|n, at) is a Gaussian 



p(r) 



(r/ro)''[l + (r/ro)'']^'^-")/^' 



(8) 




(9) 
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distribution. 



N 

P(v|n,(7t) = n 

i=l 



exp 



1 (Vi - u) 



erf 



(10) 



The product is over the number of stars in the galaxy with line-of-sight velocity measure- 
ments. The total variance at the projected radius R (at the position of the i*'' star) is thus given 
by erf = i + cr'^^i, where erf ^ refers to the variance from the theoretical distribution, and crf^ ^ is 
the variance from the measurement uncertainty. In writing equation [10] we have assumed no cor- 
relations between both the theory and measured dispersions. Since we are assuming no streaming 
motion, ctj j is replaced with equation |2l According to Bayes theorem, equation [TOl is proportional 
to the probability of the parameters given the data, i.e. P(v|m, at) oc P{u, crt|v). When considered 
as a function of the parameters, equation [10] can be then defined as the likelihood function for the 
parameters, C{9). 



It is important to note that the assumption of Gaussianity for the intrinsic velocity distri- 
bution function is only an approximation to the true phase space distribution. The true intrinsic 
distribution function will depend on the exact form of the density of the stellar population and the 
potential of the background dark matter halo. Taking the stellar distribution to be a King profile, 
the Gaussian approximation provides a good estimation of the true distribution, though there may 
be some deviations from Gaussianity in the outer most regions of galaxies From an observa- 
tional perspective, the distribution in velocities for nearly all of the dwarf satellites we consider is 
well-described by a Gaussian distribution at the ~ 2 — Scr level. For all satellites, we have found 
that tests of deviation from Gaussianity in the data have proved inconclusive. 



In addition to deviations due to the nature of the local gravitational potential, from a system- 
atic perspective the observed velocity distribution may deviate from Gaussianity if there is large 
contamination to the sample from interloping stars or from stars not bound to the galaxy because of 
tidal interactions. To accurately model the mass distribution of a galaxy with equations[I]and l2]and 
line-of-sight velocities, interloping and unbound stars must be properly identified. Positional and 
photometric criteria provide a means for identifying these interloping and unbound stars ^. Numer- 
ical simulations have also been performed to aid in the identification of contaminating stars 
Tagging stars as outliers based simply on their presence in the tails of the velocity distribution intro- 
duces an intrinsic bias into the determination of the mass, so excluding these stars as non-members 
must be done with caution. Once interlopers are cut, it is observed that the mass determined from 
the full-data sets is accurate to ~ 20% This error is most relevant, however, in determining 
the total halo mass. The presence of interlopers is even less significant when determining the mass 
at radii nearer to the center of the halo, as we do below. 
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3 Results 



We now turn to application of the analysis above and determine the best fitting masses for the 
Milky Way dwarf satellites. The primary result will be integrated masses within radii of 0.1 and 
0.3 kpc. The latter provide the main results of our analysis. The former is convenient because, 
for a few satellites, it requires less of an extrapolation of the dark matter halo beyond the observed 
stellar distribution. Our choice of 0.3 kpc as the characteristic radius probes the mass further in 
the interior of the haloes than the characteristic radius of 0.6 kpc used in a previous study This 
latter choice of 0.6 kpc was better suited for comparing the masses of only the pre-SDSS, classical 
dSphs, which are on average more extended than the new SDSS population. For dwarf satellites 
with over a hundred line-of-sight velocities, we find the strength of the constraints on the mass 
within 0.3 and 0.6 kpc to be typically similar. 

The mass within a given radius, m, is obtained by integrating the probability distribution in 
equation [To] over the model parameters. 



In practice, determining C{m) then requires an integration over all of the free parameters in equa- 
tions |7] and [8l In equation [HI M is determined by the halo model parameters at the given point in 
parameter space. Note also that in equation[TT]we have ignored the normalization of the likelihood, 
which does not depend on the model parameters. To ease in quoting confidence limits, we simply 
normalize the peak of the resulting integrated likelihood function to unity. 

We assume uniform priors on the model parameters over their respective ranges. We have 
tested the assumption of uniform priors by also considering Gaussian priors on each parameter 
over their respective allowed ranges. For either assumed prior, our mass constraints are found 
to be robust. For the dark matter halo parameters, uniform priors are chosen over the following 
ranges: 0.01 < tq < 10 kpc, < a < 1.5, 0.5 < 6 < 1.5, and 2 < c < 4. We find that all 
of these parameters are not well-constrained by the line-of-sight velocity data. We also include a 
cut-off radius in the halo profile as in equation [9l which, as discussed above, serves to makes the 
truncation of the halo sharper than the power law behavior implied by the outer slope c. 

For the radially-variable anisotropy model, the following ranges are taken: 0.1 < < 10, 
— 10 < Poo < 1, and —10 < (3q < 1. The choice of a flat prior on the velocity anisotropy parame- 
ters does in principle give a stronger weight to more tangential orbits, though we find that including 
a prior on the anisotropy that more uniformly weighs radial orbits does not affect the results that we 
present. Though we have assumed a radially-variable anisotropy profile in equation[7l we find that 
our resulting constraints on the mass are equivalent to those obtained in a model where the velocity 
anisotropy is constant in radius. The only instance in which our mass constraints weaken signif- 
icantly is in the physically unlikely scenario that the anisotropy profile has a very steep transition 
right around the King (or Plummer) radius. 




(11) 
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Figure |2] depicts the main results of our analysis: the mass likelihood functions within 0.3 
kpc for 18 of the Milky Way dwarfs. The classical, pre-SDSS dwarfs, which have more measured 
line-of-sight velocities spread over a larger radial distance, have likelihoods that are much more 
strongly constrained. Figure [2] shows that the M0.3 values are tightly bunched around ~ 10'' M0. 
The only dwarf that displays a large tail in its likelihood at ~ l-a to small M0.3 is Leo IV, which 
may be related to the shorter integration time spent on Leo IV as well as the relatively small sample 
of stellar velocities in this galaxy. 

The stellar luminosities for the Milky Way dwarf satellites span a range of approximately four 
orders of magnitude: from the most luminous satellite (Fornax, ~ 10^ L©), to the least luminous 
(Willman 1, Ursa Major II, Coma Berenices, and Segue 1: ~ 10'^ Lq). Since typical stellar mass- 
to-light ratios are of order unity, we can immediately deduce that all of these dwarfs are strongly 
dark matter dominated within the limiting radius of their stellar distributions. Nearly all of these 
dwarfs have mass-to-light ratios within their inner 0.3 kpc of Mq.s/Lq > 10. The most luminous 
systems, including Fornax, Sculptor, and Leo I, have the smallest total Mo.s/L©. Because our 
analysis determines the total dynamical mass, the stars in these galaxies likely make a significant 
contribution to the measured M0.3. However, when casting the mass-to-light ratio in terms of total 
halo mass, Mtotah even for these most luminous dwarfs we have Mtotai/-Z^ > 100 

In Tablefll we show the numerical results for the mass within 0. 1 and 0.3 kpc for each dwarf 
satellite, along with other various observed properties. The results of Table [T] also show that the 
masses within 100 pc are well-constrained, implying a common mass of ~ 10^ Mq within this 
radius. As mentioned above, for several dwarf satellites, in particular Segue 1, Willman 1, and 
Coma Berenices, determining the mass within 0.1 kpc requires less of an extrapolation beyond the 
limiting radius of the stellar distributions. In these instances, the mass within 0.1 kpc is in good 
agreement with masses determined from a single-component, mass-foUows-light analysis 6.21,22,39 

Of the newly-discovered satellites, we have not displayed results for either Bootes I or Bootes 
II. Line-of-sight velocities for Bootes I have been published though for ease of comparison 
we have chosen to use common data sets. Our initial estimate does in fact also show that Bootes I 
is consistent with the M0.3 — 10'' Mq mass scale at ~ 2 — a. We are currently unable to analyze 
Bootes II because it was discovered within the past year and does not yet have measured line-of- 
sight velocities. Of the classical, pre-SDSS satellites, we do not determine the mass for the Large 
Magellanic Cloud (LMC), Small Magellanic Cloud (SMC), or Sagittarius. It is likely that both 
the LMC and SMC have significant baryonic contribution to the their respective masses within 0.3 
kpc, so a detailed determination of the halo mass of each galaxy will require accurate modeling of 
this contribution. Sagittarius has published line-of-sight velocities though because it is in the 
process of tidal disruption, streaming motions will strongly affect the determination of its mass, 
making the above analysis not reliable. 

As discussed above, for the respective stellar distributions, we use the King or Plummer 
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profiles given in Table \T\ For several of the classical dwarfs, it has been found that in the outer 
regions of the galaxy a faint population of stars exists that has a surface density that falls off less 
steeply than either the King or Plummer profile Interpreting these stars as bound to the galaxy 
would require an increase in the total halo mass. However, we find that the presence of these stars 
has little effect on the mass within 0.3 kpc, as this latter quantity is primarily determined by the 
population of stars that are well-described by the respective parameters in Table [1] 



It is remarkable to note that, when considering both the wide range of scale radii in Table \T\ 
and the range of velocity dispersions, the common mass scale persists. Though determination of 
the masses requires a numerical solution to the Jeans equation as described above, we can gain 
some insight into the scaling of the mass with both the velocity dispersion and the scale radius by 
considering the limit in which P = 0, the surface density of stars is given by a Plummer profile, 
and the velocity dispersion is constant as a function of radius. The latter simplification is a good 
approximation for many of the dwarfs. Under these assumptions, the mass at any radius is given 



by M(r) ~ 1.2 x 10^ 



km/s 



l+y- 



, where y = r/vpi. Examining this formula, we see 



that, for a fixed projected velocity dispersion, the mass will increase if the Plummer scale radius 
is reduced. Qualitatively, this scaling can be understood by noting that it takes a deeper potential 
well to confine stars with a larger dispersion to a small radius. We note that all of our results are 
consistent with the minimum mass obtained from the virial theorem for example for Willman 

2 X 10^ Mq. 



1 we obtain a minimum mass of ~ 3 x 10^ Mq, and for Fornax we obtain a minimum mass of 



It should be stressed again that in our dynamical mass analysis we have assumed that the 
stars are orbiting in a spherically symmetric potential. This is a reasonable assumption because 
numerical simulations show that potentials of these dark matter haloes are close to spherical 
with triaxial axis ratios of 0.8 to 0.9. Dispersion and bias introduced in the dynamical mass deter- 
minations at the level of 10-20% are not important at the present stage but will be as data sets get 
larger for the fainter satellites. 



4 Tidal Disruption, Rotation, and Binary Contamination 

Equation[TO]assumes that rotational motion or external tidal forces do not contribute to the observed 
line-of-sight velocities. However, this may not be a complete description of the dwarf satellites, 
as it is very likely that their dark matter haloes have been affected by tidal interactions in the past. 
Although the simple scalings above indicate that it is unlikely tidal forces dominate the dynamics of 
these systems, it is important to quantify any deviations from steady-state dynamics more precisely. 
The goal of this section is to use the line-of-sight velocity data to provide conservative tests for 
both rotation and tidal disruption. 

Tidal Disruption Tidal forces induce a gradient in the line-of-sight velocities across a galaxy 

To test for a velocity gradient, we describe the stars in terms of their projected radial distance, R, 
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from the center of the dwarf, and the azimuthal angle of each star, 0, relative to a fixed coordinate 
system. Streaming motion would thus appear as a velocity gradient about an angle (f)o, which is 
determined from the data In the presence of streaming, the systemic velocity in the likelihood 
function of equation [10] is replaced as 

u u + Asm{(j)i + (f)o). (12) 

Here A represents the amplitude of the streaming motion, (po is the (projected) axis of the streaming 
motion, and 0j is the azimuthal angle of the star. It is important to note that, in principle, 
replacing the systemic velocity with equation [121 provides only an approximation to streaming 
motion that serves to pick out a dipolar term in the velocity field. Higher order multipoles may 
exist as a result of tidal forces; however for our purposes we assume these higher order terms to be 
sub-dominant to the leading dipolar term given in equation [T2| 

A non-zero detection of A suggests the presence of tidal forces or rotational motion. We 
have examined the line-of- sight velocities for each system and have determined the likelihood for 
the amplitude A after marginalizing over all of the halo parameters, anisotropy parameters, as well 
as 00- In Figure [3l we show the resulting likelihood functions for A for three systems: Willman 
1, Coma Berenics, and Ursa Major II. We show these systems as examples because they are the 
amongst the nearest to the Milky Way, and thus may be plausible candidates for tidal disruption. 
As is seen, there is no significant detection of A. For the Willman 1 sample we use, the mean value 
of A is ~ 2 km s^^, but this object is still consistent with no rotation at ~ 1 — cr. 

In all of the remaining newly-discovered satellites, we find no statistically significant detec- 
tion of A. In nearly all cases, we place strong upper limits on A to be a fraction of the intrin- 
sic dispersion of each system. We note that the inconclusive detection of velocity gradients in 
Coma Berenices and Ursa Major II differ from the results reported in Simon and Geha ^, where 
a weighted mean of the velocities on each side of the position angle showed an apparent velocity 
gradient. Here we find that accounting for correlations with higher order velocity moments washes 
out the detection of a velocity gradient in each of these systems. These results imply that rotational 
motion or tidal disruption in the above parametrization likely does not significantly affect the mass 
modeling. 

We note that, when interpreting the parametrization in equation [12] as a rotational signal, it 
is only an approximation to the true three-dimensional rotation. To construct a more physically- 
plausible model one would need to replace equation [121 with a model that depends on at least 
two angles that describe both the inclination of the rotation axis and the line-of-nodes of the sys- 
tem. Additionally, flattening of the stellar distribution must be accounted for by modifying the 
spherically-symmetric Jeans equation. At the very least, these effects will increase the uncertainty 
on the rotation amplitude, so in this sense our errors on the rotation amplitude are likely too strict. 
Our initial estimates show that, when adding an additional angle to account for three-dimensional 
rotation, the error on the amplitude of the rotational motion increases by a factor of about two. 
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Perspective Rotation As the satellites are extended objects on the sky, the line-of-sight velocity 
will vary as we move across the object. As we move farther from the center of the system, the 
tangential motion of the object will contribute to the measured line-of-sight velocity. This effect is 
known as "perspective rotation," and the net result is that a non-rotating object will appear to have 
a velocity gradient across the system 

Perspective rotation can be simply parametrized by replacing the systemic velocity in the 
likelihood function as m — > m + v^xjD -|- Vy-y/D, where x and y are the projected positions on the 
sky and and Vy are the tangential motions in these directions. For the purposes of our present 
analysis, we are interested in determining if the addition of the perspective rotation term will alter 
our mass estimates derived above. We find that, in all data sets, the masses we determine are 
robust even after accounting for perspective rotation. A full analysis of a given data set will require 
addition of both the streaming motion term in equation[T2land the perspective rotation term; adding 
both of these will result in further degeneracies that will make each separate effect more difficult 
to extract. 

Contamination from Binary Stars A final effect we cannot include at present that may introduce 
a systematic in determining the masses is the contribution of internal binary star motion to the ve- 
locity dispersion. For binaries with orbits of order 100 AU, the relative binary speed is comparable 
to the intrinsic velocity dispersion of the satellite galaxy. Thus the bias due to binaries depends on 
the fraction of stars that are binaries with orbits smaller than about 100 AU for the stellar popula- 
tion whose velocities are being measured. For two of the most luminous satellites, Draco and Ursa 
Minor (which also have a higher velocity dispersion), a significant effect of internal binary motion 
on the measured velocity dispersion has been shown to be unlikely A detailed study for faint 
dwarfs will require repeat observations of stars a year or two apart; this should be available in the 
near future. Here we simply note that if a significant fraction of the measured dispersion was due 
to binary motion, then the distribution of velocities would deviate significantly from the observed 
Gaussian distribution. 
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Satellite 


Distance [kpc] 


rking, rpi [kpc] 


rii,n [kpc] 


Luminosity [10" Lq] 


Mo.i [10' Mq] 


Mo.3 [10' M0] 


Ursa Minor (Umi) 


66 


0.30 


1.50 


0.29 






1 79+0-y''' 

^■'^-0.59 


Draco (Dra) 


80 


0.18 


0.93 


0.26 




09+°-^° 
"■"^-0.02 


1 07+0.20 
^■»'_0.29 


Sculptor (Scl) 


80 


0.28 


1.63 


2.15 




1 5+0-28 


1 20+°-" 


Sextans (Sex) 


86 


0.40 


4.00 


0.50 




06+° °^ 


"57+0-45 


Carina (Car) 


101 


0.26 


0.85 


0.43 




n /f8+0-07 


, cy+O.ig 
^•^'-0.10 


Fornax (Fnx) 


138 


0.39 


2.70 


15.5 




12+007 


1 14+0.09 
^•^^-0.12 


Leo II 


205 


0.19 


0.52 


0.58 




16+003 


1 43+0.23 


Leo I 


250 


0.20 


0.80 


4.79 




06+"" 


1 45+0.27 


Segue 1 (Seg 1) 


25 


0.031 




3.4 X 10" 


-4 


U.OO_g 24 


1.58+^'^^ 


Ursa Major II (Uma II) 


32 


0.127 




4.0 X 10~ 


-3 


31+018 


1 09+0-89 
r.uy_g 44 


Willman 1 (Wl) 


38 


0.025 




1.0 X 10" 


-3 




,-, 77+0.89 
'-0.42 


Coma (Com) 


44 


0.064 




3.7 X 10- 


-3 


19+009 


p. 79+0.36 
^•'^-0.28 


Ursa Major I (Uma I) 


106 


0.308 




1.4 X 10" 


-2 


34+0 15 


1 10+0-™ 
^•^'J-0.29 


Hercules (Her) 


138 


0.321 




3.6 X 10- 


-2 


19+010 


72+0-51 
'^•'^-0.21 


CVII 


151 


0.132 




7.9 X 10- 


-3 


19+0-14 


n 7(1+0.53 
^•"J-0.25 


Leo IV 


158 


0.152 




8.7 X 10- 


-3 


12+°-" 


39+0.50 

U.O»_g 29 


CVI 


224 


0.554 




2.3 X 10- 


-1 


34+0-20 


1 40+"-^* 
^•™-0.19 


Leo T 


417 


0.170 




5.9 X 10- 


-2 


39+0.25 


1 oQ+0.88 
l-^'J-0.42 



Table 1: Table of properties of Milky Way satellites. Where shown, in pai'enthesis we list the abbreviation used in Figure |2] Dwaii' satellites 
listed below the horizontal dividing line denote the newly-discovered population of SDSS dwarfs. For the classical satellites, we use King profiles, 
described by the King core radius, ri^i„g. For the new satellites, we use Plummer profiles, described by r-pi. The Hmiting radius, r-ji^, is defined 
as the limiting radius for the King profile. The error bars on the mass values reflect the points at which the likelihood function falls off to 60.6% of 
its peak value on either side. The luminosities for the newly-discovered satellites are taken from Ref. 
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Figure 2: The likelihood function for the integrated mass within 0.3 kpc for 18 of the Milky Way 
satellites. We marginalize over all parameters as described in the text. 
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Figure 3: The likelihood function for the rotation amplitude for Willman 1, Coma Berenices, and 
Ursa Major II. 
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